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Abstract 

The article exposed one original interpretation of the well-known statistical cor- 
relation and regression line, in the manner of ordinary geometry and stereometry. 
My goal was to support the thesis in my brochure “Analysis of liberty” in Serbian, 
listed at the end, where I slipped that such interpretation is obvious, but it turned 
out not to be found anywhere in the mathematical literature or on the Internet. 


1 Mean line 

Given are the two series of random variables A(a\, 02 , . . . , a n ), A' (a' 1: a' 2 , ■ ■ ■ , a' n ) and 
series of n G N units 17(1,1,. ..,1). They define three vectors a = oA, a' = oA 
and u = Ol} in a orthogonal Cartesian system of the coordinates 0£i£2---£n- The 
orthogonal projections of the points A and A! on the line OU are points M and M' 
respectively, as in figure [l] 



Figure 1: The right triangles OAM and OA'M 1 . 

As we know, the mean is the average of the numbers. If we mark the means /u(a) = a 
and //(a') = a 1 , where: 


a = 


a\ + a 2 + ■ ■ ■ + a r , 


n 


-/ _ a 'l + a 2 + • • • + a n 

n 


( 1 ) 
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then we can prove that the triangles OAM and OA' M' are rectangular, with the right 
angles in vertexes M and M ' , where: 


m = O] vi = a u, m 7 = Ol = a' u. ( 2 ) 

That is the statement of the following theorem. 

Theorem 1. The triangle OAM is right-angled with ZM = 90°. 

Proof. Calculate the scalar product: 

OaI • mA = (a, a, . . . , a) ■ (ai — a, — a , . . . , a n — a) = 

= n ■ (ai - a) + n ■ (a 2 - a) + . . . + /a ■ (a n - a) 

= n ■ (ai + a 2 + . . . + a n — nd ) = a • 0 = 0 . 

From OAt ■ mA = 0 follows OiA _L A/ A. that is ZM = 90°. □ 

Let us chose a point B on a straight line OA', so that its orthogonal projection on 
the line OU is point M. For similar, right-angled, triangles OAM and OA'M' we have 
the proportion OB : OA' = OM : OM\ then OB = a oA /a! or b = a a' /o'. Hence: 

b = oA = Oi, & 2 , • • • , b n ) = {o !\ , a 2 , a' n ), (3) 

so bk = §r o! k for all k = 1,2 ,k. From the mean line OM we can see the segment 
AB with angles tp = ZAOB and 7 = ZAMB, as shown in figure [lj 

Example 1. The series H(ll,20,5) and H'(10, 5,3) present in a Cartesian system. 
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Solution. On the figure[2]we see the right triangles OAM and barely OA'M' . The means 
are a = 12 and a' = 6, so the point B has coordinates B{ 20, 10, 6). 

The sides of the triangle OAM are: 

f a = OA = v/ll 2 + 20 2 + 5 2 = sj 546 , 

<j m = OM = Vl2 2 + 12 2 + 12 2 = \/432, 

{ s a = Ml = ^(11 - 12) 2 + (20 - 12) 2 + (5 - 12) 2 = y/lU, 

2 2 2 

and OA = OM~ + MA , so Pythagorean theorem stands. The triangle OAM is right. 
The sides of the triangle OA' M' are: 

f a' = CM 7 = VlO 2 + 5 2 + 3 2 = v/134, 

\ m' = OM 7 = \/6 2 + 6 2 + 6 2 = >/I08, 

{ s a > = WA 7 = ^/(lO- 6) 2 + (5-6) 2 + (3-6) 2 = y/26, 

and 0+ 2 = OM 2 + M A 2 . The triangle OA' M' is right too. □ 

Example 2. The series A(— 1,2,3) and + 7 (4,— 2,2) present in a Cartesian system. 



Solution. The solution is on the figure [3j The mean of the points A and A! are the same 
a = a' = |, so For the sides of the triangle OAM we find: 


OA = \J ( — l) 2 + 2 2 + 3 2 = vAa 



OM = V % 2 + (|) 2 + (t) 2 = ;/?, 
VA =S J(-1- l) 2 + (2 -l) 2 + (3- |) 2 



so OA 2 = OM 2 + M A 2 . 
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For the second triangle: 


' 04 7 = v/4 2 + (-2) 2 + 2 2 = v/24 = ^/f , 

< om = v/(D 2 +(i) 2 +(i) 2 = 

t M 7 = ^(4 - |) 2 + (-2 - |) 2 + (2 - |) 2 = yf 

and again 0^4'" = 0M“ + MA' 2 . 


2 Correlation triangle 

The triangle ABM call correlation triangle. The sides of the triangle are vectors 

{ mA = s a = (ai - a, 02 - a, . . . , a n - a), 

M~A = s b = (bi-b,b 2 -b,...,b n - b), 

AB = M A — mA = Sft — s a . 

On the other side, from the triangle OAB we have: 

AB = () A — O A = {b\ — ai, b2 — a2, ■ ■ ■ ,b n — a n ) = b — a, 

and also s& — s a = b — a. The cosine rule for the triangles gives: 

f AB 2 = WA 2 + Mb 2 - 2 • MA • IB • COS7, 

| A^ 2 = OA 2 + OB 2 -2- OA- OB -cos p. 

From the right triangles O AM and OBM follows: 

( OAI = a sin a MA = a cos a, 

\ OB = a sin a/ sin / 3 , MB = a sin a cot / 3 , 

where a = OA, a = AO AM, (3 = AOBM. Therefore: 

MA 2 + MB 2 - 2 • WA • MB • C 0 S 7 = OA 2 + OB 2 - 2 ■ OA ■ OB ■ cos p, 

s a + s b ~ 2 s a Sb cos 7 = a 2 + b 2 — 2 ab cos p, 

2abcosp — 2s a SbCOS'y = (a 2 — s 2 ) + (b 2 — s 2 ), 

2(abcosp — s a Sb cos 7) = m 2 + m 2 , 
ab cos p — s a Sb cos 7 = m 2 , 
or by scalar product of the vectors: 

a • b = m • m + s a • Sf,. 

The marks are shown in Figure [l] and m = OAl. 


Especially, the cosine of angle between the vectors s a and s a / is: 

Sa-s fe _ J2k( a k ~ a)(b k - b) 


cos 7 = 


s b 


\/j2k (ak-a) 2 \Jj2k {h-b) 2 


( 9 ) 


The same result for gamma is if use s a / instead s&. That is Pearson coefficient r of linear 
correlation. On the other side: 


cos ip = 


a • b 

a| Ibl 


/u i- dffik 


It is and the cosine angle between the given vectors a and ah 
Example 3. Find the angles for the examples [I] and ^ 

Solution. In the example |TJ 

/ cos ^ = = °- 202048 ’ 

I nnc tr 11-20+20- 10+5-6 pi oqi OOQ , n 

\ COS( ^ - v / 11 2 +20 2 +5V20 2 +10 2 +6 2 - 0.831829, ip 

so the angle gamma is more than twice larger than angle phi. 

In the example [2j 


7 


cos 7 = , — 2 —^ r ^ = = 0.576557, 7 


-^26/3-^56/3 

cos<<c> = 711724 = — 0 - 109109 , 
Gamma is almost twice larger than phi. 




55° 

96° 


78° 

34° 


( 10 ) 


( 11 ) 


(12) 


□ 


Example 4. The first vector a has the percentage of the students that are on social 
welfare in 12 schools, the second a' holds the percentage of the students that use the 
helmet while riding the 


a: 50 

11 2 

19 26 13 81 

51 

11 

2 

19 

25 


a': 22.1 

35.9 51.9 

22.2 42.4 5.8 3.6 

21.4 

55.2 

33.3 

32.4 

38.4 


Calculate angles (9) and (10). 








Solution. For a 

= 370/12 = 30.833 and d! = 370.6/12 

= 30.883, we get: 




s a : 19.2 

-19.8 -28.8 

-11.8 -4.8 42.2 

50.2 

20.2 

-19.8 

-28.8 

- 11.8 

-5.8 

s 0 ': - 8.8 

5.0 27.0 

-8.7 11.5 -25.1 - 

■27.3 

-9.5 

24.3 

2.4 

1.5 

7.5 


Then we calculate cosine (9) and (10): 


cos 7 = 
cos ip = 


-4231.14 


V7855.68V3159.68 
7195.7 . 

V19264V14605.0 " 


= -0.849266, 
0.428992, 


7 

T 


148°, 

65°. 


(13) 


The angle gamma is more than twice larger than phi, but the correlation coefficient 
r = —0.85 shows strong negative correlation. If poorer, the students use the helmet 
less. □ 

1 From the San Jose State University, then in |ll. 


5 


3 Regression line 

In the correlation triangle MAB the line BC , where C € MA , is perpendicular to the 
side MA, as you can see on the figure [4j 


0 



B 


Figure 4: Line BC is perpendicular to MA. 
From the previous, we know: 


MC = MB ■ cosy = Sb cosy, MA = |s a | = s a , 


which now gives: 


Sb 


S b Sq, • S b 


x : y = MC : MA = — cos 7 = 

Sa S a |S a | \ S b \ 

S a-Sb J2k( a k~a)(b k -b) 

X '-y= To- = ’ 

l s a | 2 Efc(«fc-a ) 2 

with means b = a. This is the regression line equation. 


(14) 


(15) 
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